SUPPORT / SAMPLES & SAS NOTES
 

Support

Problem Note 63952: Queries of Hadoop tables return "WARNING: The following column could have a length in SAS of 32767..." and cause performance problems

DetailsHotfixAboutRate It

When you query Hadoop tables, you might encounter performance problems and see a warning similar to the following issued in the SAS® log:

WARNING: The following column could have a length in SAS of 32767. If so, SAS performance is
         impacted. See SAS/ACCESS documentation for details.  The column read from Hive
         followed by the maximum length observed was:  col_v1:2

This problem occurs because SAS maps STRING data types and other complex data types to the maximum character length possible, which is CHAR(32767). This behavior occurs regardless of the actual length of the data. This behavior causes problems with performance and causes unnecessarily large SAS tables to be created.  

To work around this issue, use the DBMAX_TEXT= LIBNAME statement or data set option or the DBSASTYPE= data set option. These options control the column lengths. If you specify the DBMAX_TEXT= option in the connection string to Hadoop, the value is applied to all character columns. However, you must set the DBSASTYPE= data set option for each STRING column. 

Click the Hot Fix tab in this note to access the hot fix for this issue.

After you install the hot fix, you can access a new environment variable that fails queries of Hadoop tables that contain columns with STRING or other complex data types. You must set this environment variable before you issue a LIBNAME connection to Hadoop. If you assign the environment variable after the LIBNAME connection to Hadoop, the behavior remains unchanged.  

You can set the environment variable using the traditional methods. To set the environment variable in a SAS session, use this syntax: 

  options set=SAS_HADOOP_FAIL_32767="1"; 


Operating System and Release Information

Product FamilyProductSystemProduct ReleaseSAS Release
ReportedFixed*ReportedFixed*
SAS SystemSAS/ACCESS Interface to HadoopMicrosoft® Windows® for x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8 Enterprise 32-bit9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8 Enterprise x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8 Pro 32-bit9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8 Pro x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8.1 Enterprise 32-bit9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8.1 Enterprise x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8.1 Pro 32-bit9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 8.1 Pro x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows 109.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 20089.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2008 R29.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2008 for x649.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2012 Datacenter9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2012 R2 Datacenter9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2012 R2 Std9.439.459.4 TS1M39.4 TS1M5
Microsoft Windows Server 2012 Std9.439.459.4 TS1M39.4 TS1M5
Windows 7 Enterprise 32 bit9.439.459.4 TS1M39.4 TS1M5
Windows 7 Enterprise x649.439.459.4 TS1M39.4 TS1M5
Windows 7 Home Premium 32 bit9.439.459.4 TS1M39.4 TS1M5
Windows 7 Home Premium x649.439.459.4 TS1M39.4 TS1M5
Windows 7 Professional 32 bit9.439.459.4 TS1M39.4 TS1M5
Windows 7 Professional x649.439.459.4 TS1M39.4 TS1M5
Windows 7 Ultimate 32 bit9.439.459.4 TS1M39.4 TS1M5
Windows 7 Ultimate x649.439.459.4 TS1M39.4 TS1M5
64-bit Enabled AIX9.439.459.4 TS1M39.4 TS1M5
64-bit Enabled Solaris9.439.459.4 TS1M39.4 TS1M5
HP-UX IPF9.439.459.4 TS1M39.4 TS1M5
Linux for x649.439.459.4 TS1M39.4 TS1M5
Solaris for x649.439.459.4 TS1M39.4 TS1M5
* For software releases that are not yet generally available, the Fixed Release is the software release in which the problem is planned to be fixed.